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Abstract 

State estimation is a classical problem in quantum information. In optimization of estimation 
scheme, to find a lower bound to the error of the estimator is a very important step. So far, all 
the proposed tractable lower bounds use derivative of density matrix. However, sometimes, we are 
interested in quantities with singularity, e.g. concurrence etc. In the paper, lower bounds to a 
Mean Square Error (MSE) of an estimator are derived for a quantum estimation problem without 
smoothness assumptions. Our main idea is to replace the derivative by difference, as is done in 
classical estimation theory. We applied the inequalities to several examples, and derived optimal 
estimator for some of them. 

1 Introduction 

The quantum state estimation is a classical problem in quantum information ( [121115) ). It is not only 
useful for various purposes, e.g., evaluation of realized quantum information processing systems, but also 
is a fundamental problem in its own right. 

As is the case in classical estimation theory , often, it is assumed that the unknown source is a 
member of a parameterized family of quantum states, which is called model, to reduce the problem to 
the estimation of the unknown (in general, vector valued) parameter. Sometimes, we are interested in 
some physical quantities of the state, e.g. entanglement of the state. In such cases, the estimation of a 
function of the unknown parameter is considered. 

So far, most of the studies of quantum estimation theory assumed that the model is a smooth surface 
in the space of quantum states, and that the function to be estimated are differentiable up to necessary 
order. Under these smoothness assumptions, tractable lower bounds to the MSE of the estimator had 
been studied, and some of them give achievable bounds ( [121 1151 124) ). Especially, in asymptotic regime, 
we have lots of results on achievable bounds. 

On the other hand, when there are singularities in the model and/or in the function to be estimated, 
a little had been found out, so far as general theory is concerned. It should be stressed that there are 
some practically important examples to which the smoothness assumptions do not apply. For example, 
concurrence, a measure of entanglement, is singular at zero point. Also, considering that the state 
estimation is a fundamental problem of quantum information, it is better to be studied in most general 
setting. 

The discrete models, or the models whose parameter takes values in a discrete set, are only non- 
differentiable models which has been studied intensively, and there are many results in special cases, 
mainly by exploiting symmetry. However, there are relatively a few results which are valid for all 
discrete models. To begin with, it is not straightforward even to work out a non-trivial lower bound to 
the error of the optimal measurement. It is true that the formula in [281 114) , which is a special case of 
complementarity theorem of SDP, gives necessary and sufficient condition for the optimal measurement. 
However, this formula is far from tractable in many cases, for being a system of non-linear matrix 
equations and inequalities. 

1 Throughout the paper, the estimation theory of classical probability distributions is called 'classical estimation'. 
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The purpose of the paper is to give the first general treatment of quantum state estimation without 
the smoothness assumptions, developing quantum versions of Hammersley-Chapman-Robbins inequality 
(03, [E]), Kshirsagar inequality ([E|) and Koike inequality (JHj) m classical statistics. While the lower 
bounds under the smoothness assumption uses differential of density matrix, our new inequalities use 
difference. 

The paper is organized as follows. The Section |3 is preliminaries. We give a brief review on classical 
and quantum estimation theory, and set up the problem. In Section [3J we state our new inequalities, 
whose proofs are given in the Sectional In the Section 0] our theory is applied to several examples. 
Non-asymptotic and asymptotic inequalities are derived, and based on those, optimal estimate are given 
for some of the examples. In Sectional we state the conclusions and the future problems. 

2 Preliminaries 

2.1 Classical estimation theory 

An important problem of classical estimation is to optimize estimator of (a function of) the parameter 
which is assigned for the unknown probability distribution. The error is often evaluated by MSE under 
unbiased condition. Here, the estimator is said to be unbiased if its expectation equals the true value of 
(a function of) the parameter. 

In many cases, the optimal solution is too hard to obtain. However, a series of tractable lower bounds 
to MSE's of unbiased estimators are derived using the first order and the higher order derivatives of the 
probability distribution with respect to the parameter of the model. The inequality based on the first 
order derivative is called Cramer-Rao (CR) inequality (JHDj while those which exploit also the higher 
order derivatives are called Bhattacharyya inequalities ( 4 ). By definition, Bhattacharyya inequality is 
always not worse than CR inequality. 

In general, none of these lower bounds is achievable, but yet there are many cases where one of them 
can be achieved by some estimators. In asymptotic limit, an asymptotic CR inequality gives achievable 
lower bounds in general (PP). Needless to say, these inequalities rely on the smoothness assumptions. 

These inequalities can be generalized to the cases with singularity by replacing (resp. higher order) 
derivatives by (resp. higher order) differences. A difference version of CR inequality (resp. Bhattacharyya 
inequality) is the Hammersley-Chapman-Robbins-Kshirsagar (HCRK) inequality (0, [E], 53) (resp. 
Koike inequality (JE]))- By definition, Koike inequality is always not worse than HCRK inequality. When 
the singularity is absent, by taking difference infinitesimally small, those difference versions reproduces 
the derivative versions. We can also make an asymptotic version of HCRK inequality, but unlike the 
diffcrcntiable cases, this bound is not always achievable (HJ|)- See El f° r more detailed description of 
the theory which allows singularity. 

2.2 Quantum estimation theory with smoothness assumptions (I) 

Quantum versions for inequalities under the smoothness assumptions, i.e., Quantum Cramer- Rao 
(QCR) inequality (|12j. |24| ) and Quantum Bhattacharyya (QB) inequality (|Sj), have already been 
studied a lot. 

One important point is that, due to the non-commutativity of quantum mechanics, there are many 
quantum versions of those inequalities (Quantum Cramer- Rao (QCR) inequalities). A difficulty is that 
there is no 'best' QCR inequality. That is, to obtain the tighter lower bound, one must chose a proper 
version of QCR depending on the estimation problem of interest. 

However, if either the parameter of the model is scalar valued, or a scaler valued function of the 
parameter is to be estimated, an SLD QCR (defined later) gives tighter lower bound than any other 
QCR's. Also, in this case, an SLD QCR gives the asymptotically achievable lower bound to the error of 
estimators, as in the classical estimation theory. 

Now, we state the mathematical detail of the theory. Let Tt be a separable Hilbert space describing 
the physical system of interest. Denote the set of linear operators on TC and the set of density operators, 
by CiTi) and by S(H)(c C(TL)), respectively. Let be a parameter space which is a subset of the set 
R of real numbers, and assume that the density matrices for the system of interest is a member of a 
parameterized family of states, {pe',0 £ 0}, which is called a model. Throughout the paper, except as 
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otherwise noted, we assume 

Pe > 0, (1) 

or the density matrix pg has its reverse. 

The model is said to be differentiable (resp. non-differentiable) , if the map 9 i— > pg is differentiable 
(resp. not differentiable). We would like to estimate g(9) by measuring the system where g(9) is a 
function of 9. An estimator of g(9) is a measurement whose output is an estimate, i.e., a POVM M 
which takes values on g(M). 



Non-asymptotic theory with the smoothness assumptions 

In non-asymptotic setting, we often impose unbiasedness condition on estimators, which is defined 
as follows. 

Definition 1 The bias b(M, 9) of an estimator M at 9 is defined by 



b(M,9)= / (>y-g(0))Tr(j>oM(drf)). 

If b(M, 9) = 0, M is said to be unbiased at 9. If M is unbiased at any 9, M is said to be unbiased. 
The classical CR inequality uses a logarithmic derivative lg(x) which is defined as, 

h{x) := Jjlogp(a;,0), 

or as the solution to the equation 

p(x,9)lg(x) = — — — . 

There are many quantum analogues of a logarithmic derivative, due to the non-commutativity of 
quantum mechanics. Among those, a Symmetric Logarithmic Derivative (SLD) and a Right Logarithmic 
Derivative (RLD) are most frequently used. 

Definition 2 An SLD L s e e C(H) is defined as a solution to the equation 

d_ peLj + Llpe L s {L sy_ 
d9 P 2 ' e K e> 

An RLD Lf 6 £{T~t) defined as a solution to the equation 

Y d Po = Pe L e- ( 2 ) 

An SLD and an RLD are uniquely defined since pg is invertible by the assumption Q. The perfor- 
mance of an estimator M of g(9) is evaluated by the MSE 

Vg(M)= f { 1 ~g{9)fTr(pgM{d 1 )). 
J-yeg(.e) 

If the bias b(M, 9) is exactly zero, MSE is equal to what is called variance. An SLD and an RLD 
Quantum Cramer-Rao (QCR) inequality are stated as follows. 

Proposition 1 /|12| ) If an estimator M is unbiased, it holds that 

Vg(M) > (g'(9)) 2 /Jg S > {g'{9)f/Jg R (3) 

where 

J$ = Tr(pg(Lm J S R = Tt(peL?(Lf)*). 

The equality in the first inequality in |3J) holds if and only if M gives the spectral decomposition of a 
hermitian matrix T such that T — g(9)I € span{Lg }, where I is the identity on H. 

The proof of the proposition is reviewed in Section 

See f° r classical version of the theory, |121 1151 12"1) for the quantum theory. 
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Asymptotic theory with the smoothness assumptions 

Assuming that n identical independent samples pf n of unknown states pg is given, let us analyze the 
behavior of estimators as n tends to infinity. An estimator is a POVM M living in 7i®". If the smoothness 
assumptions hold, the mean square error converges to zero with the order of 0(l/n). Therefore, we focus 
on the coefficient of the 1 /n-term. 

In asymptotic analysis, the unbiasedness condition is replaced by asymptotic unbiasedness defied as 
follows. 



Definition 3 For the sequence M n of estimators of g{9), if the bias b(M n , 9) satisfies 

= o(l/VH), 



b(M n ,6) = o(l/V^), ^-b(M n ,x) 
ax 



M n is said to be asymptotically unbiased at 9. If M n is unbiased at any 9 £ 0, M is said to be 
asymptotically unbiased. 

The Asymptotic Quantum Cramer-Rao (AQCR) inequality is stated as follows. 

Proposition 2 Jffi) Assume that M n is asymptotically unbiased. Then, we have 

lim nV e (M n ) > (g'(9)f/J$ > {g\9)f/J B R . (4) 

n — >oo 

Especially, the first inequality is achieved by some asymptotically unbiased estimators. 

2.3 Quantum estimation theory with smoothness assumptions (II) 

In this subsection, we review quantum estimation theory for vector valued parameters with smooth- 
ness assumptions. For simplicity, here we concentrate on the estimation of the parameter itself, and do 
not treat the estimation of g(9). 

Let us denote by m the dimension of the parameter 9 = (9 1 , ■ ■ ■ 9 m ) (m = 1 if 6 is scalar), and define 
SLD Lf e of 9 % and RLD Lf 6 as the solution to the equations 

d 1 

-QQ-Po = -^{hfiPe + Peh,e) = PeL? fi - 
Define also matrices J g and by, 

Jo,i,j = -^PeiLfeL^-Q + Lj e Lf g ), jf iit = TipgLf e {Lf e )\ 

respectively. 

Proposition 3 (W%) A ssume that the estimator M is unbiased (for each 6 % ). Then, for any real valued 
positive matrix G, we have, 

SpGVg(M) > SpG(J e s )-\ 

SpGVe(M) > SpGiJ^y 1 + Spabs3G(J e i? )~ 1 . 



The proof is reviewed in Section |SJ Here, Sp is the trace over the vector space C m , and SpabsA is the 
sum of absolute values of eigenvalues of A. Typically, G is chosen to be diag(gi, ...,g m ), and in that case 
SpGVg(M) is a weighed sum of MSE's of the estimators of components of 9. 

Different from the scalar valued parameter case, we cannot say which one is better in general. How- 
ever, if [Lfg,Ljg] = 0, SLD QCR gives better lower bound, and is asymptotically achievable by the 
estimator based on the simultaneous spectral decomposition of SLD's. If LSD's are not commutative, 
RLD QCR will be often useful. 
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3 New inequalities for non-differentiable models 

3.1 Non-asymptotic theory without the smoothness assumptions 

If pg and/or the function g{9) are not differentiable with respect to 9, then the inequality does 
not make any sense. In classical statistics, Hammersley and Chapman and Robbins [H] have derived 
an inequality using the one-sided difference in place of the derivative, and Kshirsagar |17j improved it 
by the two-sided difference. We generalize their idea to the quantum theory. The proofs of the theorems 
are given in Section [SJ 

Let T be the linear space of M-valued functions of 9 G O. For f{6) G T and for 5 G M, let As be a 
linear operator defined by 

A t m = f(0 + td)-f(0-(l-t)6) _ 

We next define operators, which are analogues of SLD and RLD. 
Definition 4 For 5 G R, define if ! |(= (L^) f ) and Lg'g by 



T S ,t . J- S \t 

A t Pe L e,s + L e^ At T R,t 



(La'l and L^ s are uniquely defined since pg is invertible by the assumption l[TJl.) 

Remark 1 If pg is differentiable with 9, then Lg'l — > and L^* — * as 5 — ► since A^pg — * {d/d9)pg 
as <5 — > 0. 

If the model is differentiable both from the left and from the right, we have, 

L e;t = tLS et + 0- ~ *) L 0'~> L M = tL e} + + (1 " (6) 



where L„ ,=t and £? ,=t are defined by, 



(Observe that 



5^ = 2^ PeL «' + L ® Pe ^ = PeL 



lim A sPe = lim iAjp e + lim {l-t)A a sPe . 

5^+0 8->+0 



) If the singular model of our concern is embedded in the larger smooth model, then L^' :± and L^'^ are 
calculated from SLD and RLD of the larger model. 

Our first result, Quantum Hammersley-Chapman-Robbins-Kshirsagar (QHCRK) inequality, is given in 
the following theorem. 

Theorem 1 If the estimator M of g{9) is unbiased, then 

Vg(M) > [A\g{9)f/J s e f s (7) 
> (A^(0)) 2 /J$ (8) 

where 

J S 6 f s = Tr( Pe (L^) 2 ), J% = Tr(poL% s (Lffi). 

The equality in £Q holds if and only if M gives the spectral decomposition of a hermitian matrix T such 
that T — 91 G span{Lg'^}. 

We next present Quantum Koike (QK) inequality which improves QHCRK. For a function f(9) of 
9 G O, for a real number 5 and for an integer k > 1, we define fc-th difference operator A^ft by 

A 5 , k f(6) = (-l) k j: J2(-iy ( k ) f(9 + Si). 

Note that, if j{9) is fc-times differentiable, then As^ k f(9) — > (d k /d9 k )f(9) as 6 —> 0. Let us define Lf s k 



and Lf s k as follows. 
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Definition 5 Define Lg s k and Lf s k by 

A n P gL ls,k + L ls,kPe . s , s \ 

&s,kPe = p s Lf Sk . 

Let us define r x r matrices Kg g — (Kg 5 i J and = (K^ i .) 
(i, j = 1, ...,r) by 

tfflUij = ^{PeLlsA,^ K« M = Tr(pgL^ Al (L^ s J), 

and let v = t (Ag t s,ig(6), Ag ! s <r g(d)) be a column vector. We then have the following Quantum Koike 
(QK) inequality which generalizes and improves QHCRK inequality of Theorem 

Theorem 2 If an estimator M for g(9) is unbiased, then 

Vg(M) > V^rV (9) 
Vg(M) > \(K^ S )~\, (10) 

where (Kg s )~ 1 and (Kg g)^ 1 are generalized inverses. The equality holds if and only if M can be written 
as an observable T such that T — 91 G span{Lg s 1: Lg s k }. 

Remark 2 We can derive the Bhattacharyya inequality by Theorem 



3.2 Asymptotic theory without the smoothness assumptions 

If the smoothness assumptions are not valid, MSE Vg(M n , 9) of appropriate estimators M n does not 
necessarily scale in 0(\/n). ([21] discusses this point in detail in the classical estimation theory.) The 
convergence rate was expressed by an increasing sequence c„ that goes to infinity as n — > oo. 

In this paper, we study the following two cases. 

Case 1 The parameter of the model takes discrete values, and each point in the model is isolated, and 
9(0) = 0. 

Case 2 The model and the function g(9) are continuous, 

In Case 1, if the model has only two density matrices, the estimation of 9 seems quite close to test of 
simple hypothesis about 9. We discuss this point later. In both Cases 1 and 2, we use an RLD-type 
inequalities, since SLD-type inequalities arc mathematically intractable. 



Discrete case 

If the set of parameters is discrete and each element of Q is isolated (i.e., there is a positive real 
number c such that, for any 9and9' € 0, it holds that \\pg — pe'\\tr > c )i then the convergence rates of 
the bias and the variance of appropriate estimators are exponential. 

Assume that 9,9 + 6 e 0, and that there is no element of C R between 9 and 9 + 6. We also 
assume that g(9) ^ g(9 + 5). Then, we have the following asymptotic QHCRK (AQHCRK) inequality 
for the discrete case. 

Theorem 3 Assume that there is uq such that, if n > n$ then b(M n , 9) and b(M n , 9 + 6) satisfy 
\b(M n ,9)\ \A 5 g(9)\ \b(M n ,9 + 6)\ \A s g(9)\ 

-^6^-^^' and \6\ ^—3— ■ (U) 

Then 

liminf - \og(V e (M n )) > - log(l + 6 2 jfj). (12) 
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Proof Observe 

Trp^p 2 e+S = l + 6 2 J^ S \ (13) 
Replacing pg and pe+s in (|13[1 by pf n and pf? s , we obtain, 

Tr( P rLff\Lff^) = ^{Tr^™)- 1 ^-) 2 - 1} = 1 pl +S T l) 

= ^(( 1 + ^ 1 )"- 1 )' ( 14 ) 



where the last equation is due to l|13(l . 

The assumption (|11|) and the formula l|28|) imply that, for n > no, 

-logVe(M n ) > -2log\A s b(M n ,e) + A s g(d)\ - - logTr(pf n s L B e f \hf <">)t) 

Ti ft TL 

no n o A 

. l. |A gg (g)| 1 (l + ^ 2 jff)"-l 
> -2 log log f 

- -log(l+5 2 J e y) (n-oo). 

Hence (JEJ holds. □ 

We would like to remark that log(l + 8 2 J^f) is always not smaller than the relative entropy D(pg + $ | 
pg) defined as 

D{pe+s\\pe) = Ti(p e+s (logpg +s - logpe)). 

The reason is the following. Consider a model O = {9, 9 + 8}. Due to the theory of quantum hypothesis 
testing (^21 savs that HOI)? f° r anv e > 0, there is a sequence N n of POVM's taking values in R such 
that 

Tr(pfZ 5 N n (9)) < J^£|M, Urn I logTr(pf n N n (9 + 5)) = -D(p e+S \\ pg). (15) 

We can regard N n as an estimator of 9 satisfying pifl. Indeed, there is no such that, if n > no then 

\b(N n ,9)\ _ \8\Ti(pf n N n (9 + 5)) \A s g(9)\ \b(N n ,9 + 8)\ _ \S\Tr(pf^ s N n (9)) \A s g(9)\ 
\S\ \S\ " 3 ' \6\ \5\ " 3 ' 

Moreover, 

UogV e {N n ) = ±log((g(0 + 5)-g(6)) 2 Tr(pf n N n (9 + 6))) 
-> -D(p e+S || p e ) (n -> oo). 

Due to JEJl, we have D{p e+S \\pg) < log(l + S 2 J e R ' s 1 ). 

Continuous case 

Let Lg'^'* be the solution to the equation 

^sPg — Pg ^g,s ■ 



For heR,let 6 — h/c n . Define that 



1 

- 72 

h— >+0 n^oo <- ?! 



= Hmsuplimsup^Tr(pf n L^*jLf;™^)t), 



g' t {9) = limA^fl). 
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If pe is differentiable from the right (resp. left), J^' 1 (resp. J^'°) is equal to where the two-sided 
derivative in J5J) is replaced with the right (resp. left) derivative. This is shown as follows. Replacing 5 
with h/s/n and c n with s/n in (|14J) . we have 

T n,i , , a + ^wn/") m -l r 
J 9 — lim svip lim sup — = Jim sup 



h — >-\-0 n — t-oo 

= J?. (16) 

(The left differentiable case is also given in a similar way.) 

Next, we define asymptotic unbiasedness for a sequence M n of estimators of g(9) based on measure- 
ments of pf n as follows. 

Definition 6 A sequence M n of estimators of g(9) is said to be c ra -unbiased at 9 from the right if 

c n b(M n , 9 + h/c n ) — > as n — ► oo 
for any h > which is small enough. Similarly, M n is said to be c n unbiased at 9 from the left if 

c n b(M n , 9 — h/c n ) — > as n —* oo 

for any h > small enough. 

Then, we have the following asymptotic right (resp. left) QHCRK inequality. 

Theorem 4 Assume that M n is c n unbiased at 9 from the right (resp. from the left), and J^' 1 < oo. 
Ifi 1 ! (#)| < oo ( resp. J^'° < oo, |.9o(^)l < °°)- Then, 

limmi c 2 n Vg(M n ) > (g'M) 2 / J? 1 (17) 

n— »oo 

( resp. 



iimMc 2 n v e (M n )>(g' (e)yj^ ; 

n — >oo 

The proof is given in Section 

Remark 3 Due to l|16|l . theorem 0] implies AQCR inequality. Notice that this argument is technically 
much more complicated than the proof of the similar statement in non-asymptotic theory. In fact, it is 
not straightforward to prove a similar relation between asymptotic SLD-base inequalities, for the absence 
of the equivalence of the equation (|14(l . 



Define an operator A' s by, 



3.3 Non-asymptotic theory for a model with a vector valued parameter 

v* 

f{9\ g + tS, ...,9 m ) - f(9\ g - (1 - t)S, ...,9 m ) 

A i,5/(, y ) = g 1 

St i?, t 

and difference version of SLD L i ' e s and RLD L i g s is defined as the solution to the equations 

At *-( T S,t . r S.t \ T R.t 

\sPe = 2^ L i,0,sP e + P e he,s> = Pe L i,e,s- 

The following relation will be useful for the further analysis. 

For t — {ti, ...,t m ) (0 < U < 1) and S — (Si, ...,S m ), define matrices J„'t> J^'x by, 

jS,t 1 m / r S,ti rS,tj j-S,tj j-S,ti \ tR:^ rjn tR^j { T R,ti \f 

Analogically to models with scalar parameters, we have the following theorem, which is proven in 
Section 

Theorem 5 If the estimator is unbiased, for any d and for any t such that < U < 1, we have, 

SpGVe(M) > SpG(jJj)- 1 , 
SpGVg(M) > SpG(J^)" 1 + Spabs3G(J*£) _1 . 
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4 Examples 



In this section, we will demonstrate our theory in several examples. 

4.1 Estimation of concurrence 

For — 1 < 9 < 1, let pg be a density matrix on a 2 x 2 system given by 

w = ^l* + )<* + l + ^l*-><*-| 

where 1$*) = (|00) ± |ll))/v2 are maximally entangled states which are mutually orthogonal. This 
type of state can be produced by parametric down conversion f |18|L 

Let g(ff) = \9\, which is equal to the concurrence (|22|). a measure of the quantum entanglement 
between two 2-level systems. At 9 ^ 0, we can define the usual SLD and RLD, and both Jg and Jg are 
given by 

T s _ t r _ 1 



I — 9 2 ' 
At 6 = 0, we have 

T S,1 _ jR„l _ 1 
J 0,5 - J 0.S - ]_ _ 02 

for any —1<6<1. Therefore, QHCRK inequality is 

Vg(M) >l-9 2 . 

In this model, however, there is no unbiased estimator for g(ff) = \9\. Therefore, the inequality is 
nonsense in the non-asymptotic setting. 

Let us consider the asymptotic case, where we construct a sequence M n of estimators of g{9), which 
achieves the asymptotic AQHCRK bound. Here, c n = s/n. The AQHCRK inequality for estimators 
which are -^/n-unbiased from the right is given due to (|16|l 

lim c 2 n V 9 (M n ) >l-9 2 . 

n — >oo 

This bound is achieved by the following scheme. 

Step 1 For i = 1,2...., n, we measure i-th subsystem of H® n by a two valued POVM {|$ + ) (<&+ 1, / - 
|$+)($+ 1}. Let Xi = 1 if |$ + )(<I> + | is observed, and let x l = if I - |$ + )(<I> + | is observed. 
Step 2 Let 

Yh=i x i/ n if E"=i x i/ n ^ -n~ 1/3 , 
K ~ E"=i x il n if E"=i Xi/n < -n- 1 / 3 . 

The expectation Eg(y) of y for pf n is 9 + o(l/n) as n — > oo so such an estimator M n is 1/ -y/n-unbiased 
at 9 from the right, and V$(M n ) = (1 - 9 2 )/n. 

4.2 A discrete model 

This example can be regarded as a non-commutative version of the classical model of discrete uniform 
distributions. 

Define 2x2 matrices a\, o-i and O2 as 

01 = (0 0) ' ° 2 = (1/2 ' ° 2 = (0 

Let Tt be an infinite-dimensional Hilbert space spanned by |1), |2), |3), .... For 9 € Q(= N: the set of 
natural numbers), let pg G S(H) be a density operator of the form 

'0 _1 diag((T2, ■■•,02,0-1,02,02, ...) if 9 is odd, 

(0-l)/2 

_1 diag(a 2 , cr 2 , 2 , 2 , ...) if 9 is even 



V = 



0/2 

where diag(...) means the diagonal matrix. 
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Remark 4 Though pg is not invertible on H, the assumption (JIJ essentially applies to this example. 
Indeed, for the achievability of the lower bound, it is sufficient to consider the subspace H.' g (c Ti) 
supported by pg. 



One-sample analysis 

When 9 is even, there is an estimator of 9 which minimizes variance. Define a hermitian matrix T by 

T = diag(T 1 ,r 3) T 5 ,...) 

where, for an odd number i, 

T -( 2i ~ 1 V2 \ 
* ^ 1/2 2i + 1/2) ' 

Let the spectral decomposition, described by M, be the estimator of 9. Since 

Tr(pgT) = 9 for any 9, 

M is an unbiased estimator. 

If 9 is even, then M is optimum. The proof is as follows. Define Ao,Ai, Kg-x by 

p e k k + A k p g 



that is, 



Since 



Pk 



(fc = 0,l,. 



1), 



:diag(l,..., 1,0,0,...) 



A,- = I 



e-i 



-diagf 1,..., 
i \v — „ — 



-i/6 i/r 

1/3 5/6, 



,0,0,. 



if i is even, 



if i is odd. 



A_i, 



peLl_ hk + L s g _ lk pg 



2 

p«((-i) fc Eto(-i) l (-)A0 + ((-i) fc ELo(-i) l (-)A0^ 



we have 



^0,-1, k 



-i)*£(-i)' ; A 



i=0 



That is , there is a triangle matrix U with non-zero diagonal elements such that (L g _ 1 , Lg _, , . . . , Lg _ 1 
[7*(Ao, Ai, Ag_i). This means that {Lf _ 1 fc } and {A^.} have a one-to-one correspondence. Since T 
is an element of span{A , Ai, ...,Ag_i}, T — 8 e spanjLg _ 1 1 , ...,Lg _ x g^}. Hence the equality in © 
holds. 

By direct calculation, the MSE of M is given by 



Ve(M) 



6» 2 /3- 7/12 + 1/(26*) if 6> is odd, 



073-7/12 



if 6 1 is even. 



When 9 is odd, a similar argument of _ x k implies that the best estimator should be given by the 
spectral decomposition of T" of the form 



where 



The MSE of the estimator is 



T' = diag(Ti,T 3 , T e _ 2 , T' g , T e+2 ,...) 

T , _ {20 - 1 
" 1 29 + 1 



9 2 /3 - 7/12 + 1/(461) 

for odd However, T' depends on the unknown parameter 9. In this sense, there is no estimator which 
is uniformly achieves the QK bound. 
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Asymptotic analysis 

Let us consider the asymptotic setting, pg = pf n . Since this model is discrete and each element pg 
is isolated, due to Theorem^ we have the following lower bound to liminfn^oo log Vg(M n ), 

R1 _ Jlog(0/ (9 — 1)) if 9 is odd, 

gl ]tog(0/(0-l))+log$E?j if Sis even. 

Based on asymptotically optimal test between two hypothesis, we will construct an estimator M, which 
achieves the AQHCRK bound to the discrete case of Theorem if 9 is odd. 

M is constructed as follows. First, let ©' be the set of positive even numbers. Next, let M' be a 
POVM of the form 

M(0') = diag(O,...,O,J 2) O,O,...) 

0'-2 

where 1% is the 2x2 identity matrix. We independently apply M' to each sample, and let 9[ € ©' 
be the observed value for the i-th sample. Let # max = maxf^, ...,9' n }. Our next step is estimation in 
a submodel O' = {0 max , 6> max — 1} C ©. Let N n be a POVM given in (|15(1 where 9 = # m ax — 1 and 
9 + S = 9 max . Since each M'(9') commutes with pg _ 1 and pg , due to |T2|, M' and N n commute. 
Applying such a measurement N n , the estimate 9 of 9 is given by 

a _ f^max - 1 if 4iax - 1 G 8' is observed, 
|# ma x if # max G 6' is observed. 

The exponent of MSE of this estimator M is given by 

liminf UoMM) = h D( f^ I »> + if 6 iS ° dd ' 



where 



if 9 is even. 



D{pe~i || pe) > - log(l + Jg ^x) if & is even 

- log —= 

1 8 V3 



6 1 2 

1 II Pe) = log 7 — 7 + 7 — 7 lo S 



4.3 Submodels of Gaussian state model 

We consider singular submodels of Gaussian state model, whose theory f |241 115p is reviewed in the 
next subsection. In last subsection of this section, we study a vector valued submodel of Gaussian 
state model, to sec the effect of non-commutativity explicitly. Before that, for preparation, we study a 
submodel of this model, which has scalar valued parameter. 



Gaussian state model 



For simplicity, we chose the unit such that ft = 1 . Gaussian state model is a family of states living in 
the Fock space, TC = span{|n); n = 0, 1, 2, • • • }, such that 



dpdq 



1/21 \p,q)<J>,q\, 



27T((7 2 - 1/2) 

where \p,q) is a coherent state. After some calculations, one would obtain f|TS]L 



l {P-9% Ll e = \(Q-e 2 ), 



a 4 - 1/4 



a 4 - 1/4 



J - a 2 



1 
1 



r« — 



a 4 - 1/4 



2 
r 2 



(18) 
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The submodel {p^ 9 i s iyQ 2 G ^} and the submodel {p%i e 2y9 1 G M} are exponential families, and 
the equality in the SLD QCR inequality, 

V B (M) >a 2 . 

is achieved for both models, the former by the spectral decomposition of P, and the latter by the spectral 
decomposition of Q. 

For the model {pf;9 G M 2 } with 2-dim parameter, it is known that the equality in the RLD QCR 
inequality is achieved. Namely, with G = I, the RLD and the SLD QCR inequality is given by 

SpV e (M) > 2a 2 + 1, 
SpV e (M) > 2a 2 , 

respectively. For SLD's, being scaler multiplications of P and Q, are not commutative, the achievable 
bound is larger than the bound by SLD QCR. 

If the thermal noise is very large and thus a is very large, the difference between SLD QCR and RLD 
QCR is negligible, meaning that the effect of non-commutativity is relatively negligible. 

Letting \p, q) be a coherent state, the optimal estimator is 

{\e\e 2 ){e\e 2 \\. 

This POVM is often referred to as optimal simultaneous measurement of P and Q, for this satisfies the 
relation 

J^p\p,q){p,q\=P, J^q\p,q)(p,q\ = Q, (19) 
and gives the smallest sum of MSE's in all such POVM's. 

A singular submodel with scalar parameter 

The model {p g ; 9 G K}, where 

_ / P(o,o) ® Pfe,o) (0 < °) 
96 \ 4,o)^(o,o) (*>0) ' 

has a singularity at 6 = 0, and the singular point is differentiable both from the left and the right. In 
the following, we denote by Pi and Qi the quadrature operators of the mode i. 

SLD QHCRK inequality for 9 ^ coincide with SLD QCR inequality, and is given by, 

Vg(M) > a 2 (0^0). (20) 

The equality is achieved by the spectral decomposition of P2 (resp. Pi) if 9 < (resp. 9 > 0), in this 
region of the parameter. However, neither Pi nor Pi is unbiased all over the model. 
At the point 9 = 0, due to © and (|18[) . we have 

hm _ L S e l = \{tP 1 + (\-t)P 2 ), 

and letting t = |, we obtain the lower bound, 

V (M) > 2a 2 . (21) 

The estimator given by the spectral decomposition of Pi + Pi is unbiased at all 9 G G, and achieves the 
equality in the inequality l)21[l. This estimator, however, does not achieve the lower bound by 1)20(1 at 
9^0. 

We conjecture, however, this estimator is optimal, and that the lower bound (|21l) is a lower bound 
all over the model. Indeed, we can improve the lower bound l|20|l as follows. Almost in parallel with 
Theorem 13 we have the following Koike type inequality: 

VJM) > (1 1)( Tl P eL *i( L *iy ^PoLfiiLf;^ \ f 1 A 
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0.5 1 1.5 



Figure 1: A Koike- type lower bound is plotted against Si, when a = 1 and 9=1. With optimal 6±, the 
new equality improves SLD QHCRK inequality with 8 — > 0, which equals 1 (= ct 2 ). 



Let us chose infinitesimally small 82, or take the limit 82 — > 0. Then, the right hand side is, 



1 (8 



\2 



(23) 



J 1 e J((fl) a +(*i) a )_i-(0)2 l 7' 

with J's being the (1, l)-component of for the model {pf; 9 S R 2 }, which is given in the equations @ 
The detail of the calculation is omitted, but the key is the following equality: 

^Po^P( x ,y)P( z , w ) = e J (-+^)-^-«*), (24) 

with A being the (1, 2)-component of J@ for the model {pf; 9 E M 2 }, which is given in the equations 118fl . 
The rough derivation of the equation l|24|) will be given in Section [3] 

Let us chose 81 to be very small, or take the limit Si — > 0, and consider the case where ^5- <C 1 and 
■ir <C 1 . Then, the bound <|23|l is very close to the bound jU , which we conjecture is the tight bound 
all over the model. At least, as in Fig. the new lower bound improves the bound by SLD QHCRK 
inequality with 8 — > in some cases. 

A singular model with the vector valued parameter 

The model {p e ;9 eR}, where 

P§,o)®/>(kn (^<O,0 2 <O) 
, pgio)®P§*<) (^>O,0 2 <O) 
p| 92) ®/f 91)0) (^<O,0 2 >O) ' 
/fflV»)®pg,o) (^>O,0 2 >O) 

has singular point at = 0, and the singular point is diffcrcntiable both from the left and the right. The 
parameter space is broken down into the following three parts according to types of singularity. 

(1) 9 1 ^ and 6 2 ^ 0: differentiable 

(2) 6 1 ^ or 6 2 ± 0: differentiable with respect to 6 1 or 6 2 . 

(3) 1 = and 9 2 = 0: un-diffcrentiable with respect to 9 1 and 9 2 . 

In case of (1), QHCRK inequalities for 9^0 coincide with their differentiable versions (QCR's). 
Namely, in the region {9; 9 1 < 0, 9 2 < 0} the bound by the RLD QHCRK inequality 

SpVe(M) > 2ct 2 + 1 (25) 

is achieved by the estimator {{9 1 , 9 2 ) (6 1 , 9 2 \ (g> /}, where \9 1 ,9 2 ) is a coherent state. The similar holds 
for the region {9;9 1 > 0, 9 2 > 0}. In the region {9;9 1 < 0, 9 2 > 0} the bound by the SLD QHCRK 
inequality 

SpVg(M) > 2cr 2 
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Figure 2: The lower bound by the QHCRK inequality is plotted against t 2 and a 2 . When a = h (no 
thermal noise), the largest lower bond is given by t% = 0. As a 2 tends to larger, the peak moves towards 
t 2 = \- 



is achieved by the simultaneous spectral decomposition of P 2 ®Q\. The similar holds for the region 
{6; 9 1 > 0, 9 2 > 0}. However, none of those 'locally optimal' estimators is unbiased all over the model. 

Next, we consider case of (2). Namely, we consider the region {O;!) 1 < 0, 2 = 0}. (Other regions 
in this case are essentially the same.) If 6* 1 < and 2 = 0, due to © and (|18|l . we can compute 
lim,j^o+ -^f '$ hm5^o+ Lf'g'g (i — 1,2), explicitly, and the RLD and the SLD QHCRK inequality is 
obtained as, 



{° 4 



SpVe(M) > — ^ 7 — w {2a 2 (4 - t 2 + 1) + 1 - t 2 }, 

^2 



a 4 (2t 2 - 2t 2 J- 1^ - il^i 



S ^ M ^ 2t 2 -2 h + l + ^ 

respectively. 

Now, our concern is the value of t 2 which gives the largest lower bound. If a is very large (thus effect 
of non-commutativity is negligible), then the right hand side of the RLD QHCRK is almost the same as 
the right hand side of the SLD QHCRK, and gives the largest lower bound at t 2 = 1/2. On the other 
hand, let us consider the opposite extreme case where the noise is purely quantum, that is, the Gaussian 
states are coherent states. This case is obtained by letting a 2 = 1/2. If and only if t 2 — 0, the right 
hand side of the RLD QHCRK gives the non-trivial lower bound, 

SpV e (M) > 2. 

(If t 2 0, the right hand side of the inequality vanishes.) 

To sum up, the value of t 2 which gives best bound is dependent on the non-commutativity inherent 
in the model. 

Finally, the case of (3) is analyzed. Here, we have Due to © and IjlSJ) . lim,5^o+ Lfe^ (i = 1,2) are 
computed explicitly. Letting t\ = t 2 = 1/2, we have the following RLD QHCRK inequality, 

SpVe(M) > 4ct 2 + 2. (26) 

This choice of the parameters t\ , t 2 is optimal, for the unbiased estimator which is constructed below 
achieves this lower bound. 

Let P x . = Q x , = j Pv = Q2m = Qi^ch^ Thenj Pi ^ Qtr satisfies the 

canonical commutation relation, and Pi* , Qj* (i ^ j) commutes. Consider the new split of the system 
into the mode 1* and 2*, introduce a new coherent state \p, q}* with respect to P\* and Qi», and define 
an estimator 

{I07VM7V2)* *{o l /^,e 2 /y/2\®iy 
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Figure 3: RLD QHCRK bound with S i = 20* ((a)), with 5 i -> ((b)) are plotted against (0* < 0). (c) 
is the conjectured lower bound (and is RLD QHCRK bound at = 0). (a) is above (b) in some region, 
and is very close to (c) around the origin. 



Observing that the expectation of P1+P2 and Q1 + Q2 coincides with 9 1 and 8 2 respectively, unbiasedness 
of the estimator is checked by using (|19|) . A straight forward calculation shows that this estimator 
achieves the lower bound implied by Ij26|l . 

This estimator is optimal at = 0, and unbiased all over the model. We conjecture that the lower 
bound at other points of the model is improved to show the optimality of this estimator. 

In fact, for example, the lower bound (|25|l is improved by using RLD QHCRK inequality with a finite 
5. Let 9 1 < 0, 6 2 < 0, and 6 X = -9 1 + t, 5 2 = -6 2 + s, for s > 0, t > 0. Then, due to (J2H, the RLD 
QHCRK inequality is calculated as, 

S P V e (M)> a + d+2lcl 



ad — b 2 — c 2 ' 



Here, letting J and A be the (1, 1)- and the (1,2)- component of the jf for the model {pf;9 € K 2 }, 
respectively, 

eJUey+W 2 ) _ 1 r /((e 2 ) 2 +(s) 2 ) _ 1 , „, , 2 „ 

a= 6 - ^ 2 , d= e ——- 6 + V^Tc = e -^ ts+ele2 »-l. 

{0 1 +t) 2 (9 2 + s) 2 

When t«1 and ^Qr < 1 (i = 1,2), letting t — 9 1 and s = 9 2 , the bound is nearly 4c 2 , and is close 
to 126JI . which is conjectured to be a lower bound all over the model. 

As is indicated in Fig. EI w e can observe in some region of the parameter space, the new bound, RLD 
QHCRK bound with S i = 29 l (8 l < 0) improves JUJ). 

5 Proofs 

In this section, we give proofs of theorems. 

First, we prepare a lemma which says that the best unbiased estimator is a projection valued measure 
(PVM). 

Lemma 1 Let 

T= / 7 M(d7). 
•/769(e) 

Then, it holds that 

7 2 Tr(p e M(d 7 )) > Ti{ Pe T 2 ). 

763(e) 
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Proof. 

Hence we use an observable T to describe the POVM M in Proofs of Theorems ^ an d 12 



/ 7 2 Tr(p e M(d 7 )) - Tv(p e T 2 ) = Tr(p e [ (7 - T) 2 M(d 7 )) > 0. 



5.1 Proof of Theorem [T] 

Let T £ C(H) be an operator satisfying Tr(pgT) = 5(0). By Schwartz's inequality, it holds that 

Tv( Pe (T ~ g(e)) 2 )TT( P9 (L s e ) 2 ) > (Tv( P9 {T - g{e)L s e )) 2 

= (TT((T-g(6))(A 5 pe))) 2 = (A s g(8)) 2 . (27) 

Therefore, 10 holds. In addition, 

< Tr(p e (Ll 5 -L* 5 )(Ll s -LK s y) 

= Tr(p e (Ll s f) - 2Tr(Ll s A sPe ) + TcfaLf^Lfo?) 
= -Tr(p e (Ll s f) + Tr(p e L^(L^n 

Hence, (JHJ) holds. The equality in J23 holds if and only if T - g(6) = cLjj for a constant cel. □ 

5.2 Proof of Theorem H 

. Let 

v= t (T-g(0) ) L§ A1 ,L$ A i,... ) L$ Ak ). 

Let V be a k + 1-dimensional C- vector space, Then, v can be regarded as an element of V (g> C(TL). Let 
Iy be the identity matrix on V. Let P = pg Iy . It holds that 



0<Tr w (P^ )= ^(M) ^^^^ 



where Tr-^ means the partial trace over Ti. Consequently, we obtain 

V e (M) - ^(K s )-\ = (1 *v(^)- 1 ) A ( {K s)-x v ) > 0. 
This implies By a similar argument, (|10(1 is also shown. □ 

5.3 Proof of Theorem |U 

In a similar way to (|27J) . Schwartz's inequality implies that 

(A h/Cn b(M n ,0) + A h/Cn g(6)) 2 



<V e {M n ) > 



> (A h/Cn b(M n: 8) + A h/Cn g(e)) 2 



and hence (|17[) holds as n — > 00. 
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5.4 Proofs of proposition |3] and Theorem [5] 

Let 

(A, B) hp = ±Trp{BAi + A*B), (A, B) i>p = ^p(BA^), 
X = {X\X 2 ,...,X m ) T , Z P (X) = [(X\X>) 2 J. 

Observe that, 

nz p (x) = i(x\xJ) hp }. 

Lemma 2 

V e (M) >Z p (X), V e ( M ) > m p (X) . 

Lemma 3 Let (•, •) be an inner product, and X = (X 1 , X 2 , X m ) T , L — (Li, L 2 , L m ) T , where 
X , X 2 , X m , Li, L2, L m are members of the vector space of our concern. Denote matrices [{X 1 , X 3 )], 
and \{Li,Lj)\, by and T^, respectively, and assume [(X l ,Lj)] = S*. Then, T^ 1 and TZ 1 exists, and 

holds. 

Lemma 4 For any positive hermitian matrix Z , we have 

min SpGV = SpGZ + SpabsSGZ, (29) 

where V runs over all positive real matrices such that V > Z . 
For the proofs of Lemmas 121 II see JH] ■ 



Let (•, •) be (•, -)i, Pe and let Li be Lf g (resp. L. ' g ) in Lcmma|31 and combine with Lemma|2]and 
Lemma 0| Noticing that QJg = (resp. SjJ'^ = 0), then, we have the first inequality in Proposition 
(resp. the first inequality in TheoremEJ. Similarly, letting (•, •) be (•, -)2, Pe and letting Li be Lf g (resp. 
L R gg) in Lemma |3 we have the second inequality in Proposition [3] (resp. the second inequality in 
Theorem |5j) • 

5.5 Derivation of the equation (1241) 

J3—1 „G „G 



Due to Pq = (1 - c) Yln=o c "i where (1 - e -ip = 0-2 - |> we have 



Tr P0 P{x,y)P(z,w) 

Jaec/Jec 4tt 2 (ct- 1/2) 2 ^ 

= (l_ c )/ __ ±_. e 2(^-1/2) \ c " e -2l"l e - +a /3 " — -=e" l/31 

ia eC ,/3 eC 47r 2 (a-l/2) 2 ^ v^l 

_ /■ dW/3 te^d^zp^ - H'-ifl'+a^ y> (gag 

1 j iaEC,^c47r 2 (a- 1/2)2 A. n! 



1 V QeC ,/ 3eC 4 7 r 2 (a-l/2) 2 
Repeated application of the formula, J e _j4 ( x ~ B ) = leads to the equation l|24|l 
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6 Discussions 



We have given the first treatment of quanta singular statistical models. Namely, we proposed two 
kinds of quantum versions of HCRK inequality and Koike inequality. One is based on difference version 
of SLD, and the other is based on the difference version of RLD. The asymptotic theory based on RLD 
QHCRK inequality is also discussed. 

We applied those results to some examples. First example was estimation of entanglement measure, 
which is very simple but practically of some importance. We constructed asymptotically optimal estimate 
in this case. Second example is estimation of discrete parameter. It should be stressed that from the 
viewpoint of conventional state detection theory, the non-asymptotic analysis is very hard, while our 
approach is technically more tractable. Third example is estimation of vector valued parameter, and 
here we could observe the effect of non-commutativity. 

There are many unsolved problems. Especially, asymptotic theory is far from complete. For example, 
an asymptotic theory based on SLD QHCRK inequality is desired, for SLD-based lower bounds are 
better than those based on RLD, if the parameter is scalar. As for the piece-wise differentiable models, 
we can trivially extend the theory of differentiable by imposing asymptotic unbiasedness instead of yfn- 
unbiasedness on the estimator. However, SLD versions of Theorems and 0] are very hard to obtain, 
for we don't have the equivalence of (|14|) . Note that this difficulty is due to non-commutativity, or 
more specifically, the hardness of calculation of SLD. Hence, the similar problems never arise in classical 
estimation theory. 
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